Gauge independence of the S-matrix in the causal approach 



Andreas Aste, Giinter Scharf 
Institut fur Theoretische Physik der Universitat Zurich, 
Winterthurerstr. 190, CH-8057 Zurich, Switzerland 

Michael Diitsch* 
II. Institut fur Theoretische Physik, Universitat Hamburg 
Luruper Chaussee 149, D-22761 Hamburg, Germany 

^ | September 30, 1997 



O 

O 
o 



Abstract 



The gauge dependence of the time-ordered products for Yang-Mills theories is analysed 
in perturbation theory by means of the causal method of Epstein and Glaser together with 
perturbative gauge invariance. This approach allows a simple inductive proof of the gauge 
, independence of the physical S-matrix. 
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1 Introduction 



The properties of (quantum) gauge invariance and gauge-parameter independence, which are 
inherent in all kinds of gauge theories, have always been of great interest. In the calculation 
of physical observables, i.e. S-matrix elements, the question of gauge parameter independence 
arises automatically. In the usual Lagrangean approach to quantum field theory, the gauge 
invariance of the classical Lagrangean has to be broken in order to quantize the theory. Therefore, 
gauge fixing terms which depend on free gauge parameters are added to the Lagrangean. The 
theory then still has BRS invariance [1]. The gauge parameters drop out in S'-matrix elements 
between physical states. But Green functions are gauge dependent in general. On the other 
hand, it can be shown that Green functions of the special class of gauge invariant operators are 
independent of the method of gauge fixing and then gauge-parameter independent [2]. 

In fact, the crucial property of gauge theories which allowed to show the gauge independence 
of physical S'-matrix elements by path integral methods is BRS invariance, holding for arbitrary 
gauge parameters. BRS invariance implies generalized Ward-Takahashi identities first proved 
by Slavnov and Taylor [3,4]. One considers then the generating functional W\( J) of the theory, 
where A is a gauge parameter and J the external source coupled to a physical field (e.g. a gauge 
or quark field). Changing the gauge parameter A by an infinitesimal amount d\ and using the 
Slavnov- Taylor identities, the desired result can easily be derived [5]. 

The property of gauge parameter independence has recently gained renewed interest also in 
practical problems. For example, the introduction of running couplings can only be achieved 
by a resummation of certain subsets of Feynman diagrams [6,7,8], and it is then necessary to 
define a general procedure for maintaining the gauge independence of the theory. Of course, 
the significance of such resummed objects is always questionable. Furthermore, the problem has 
also been discussed in the framework of the background-field model for the electroweak Standard 
Model [9]. 

It is the aim of this paper to describe the situation from a totally different point of view 
for the example of pure Yang-Mills theory, without making reference to path integral methods. 
Some years ago, some of us [10, 11] began to advocate the causal approach to perturbative 
quantum field theory, which goes back to a classical paper by H. Epstein and V. Glaser [12]. 
No ultraviolet divergences and only well-defined objects (no interacting fields) appear in this 
approach. Meanwhile, the method has been applied successfully to full Yang-Mills and massive 
theories as the electroweak Standard Model [13]. 

In the causal approach, the S-matrix is viewed as an operator-valued distribution of the 
following form: 



where g £ S, the Schwartz space of functions of rapid decrease. The T n are well defined time- 
ordered products of the first order interaction T±, which specifies the theory. For example, for 
QCD without matter fields one has 



where = d p ' A v a — d^A u a is the free field strength tensor and u a ,u a are the (fermionic) ghost 
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dxi... dx n T n (xi, ...x n )g{x 1 ) ■ ... g{x n ), 



(1.1) 



Ti(x) = igfabc{\ ■ A^(x)A ub (x)F^(x) : - : A^(x)u b (x)d"u c (x) :} 



(1.2) 
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fields. The asymptotic free fields satisfy the well-known commutation relations 

[A<£\x), A^\y)\ = isTD^\x - y) + i^(9^)W(x - y) (1.3) 
in the so-called A-gauges, and 

{u^(x),u^\y)} = -iD&\x-y), (1.4) 

where D and E will be defined below, and all other {anti-}commutators vanish. (For the gen- 
eralization to the massive case see [14].) The introduction of ghost fields is necessary already 
at first order to preserve perturbative quantum gauge invariance, which we are going to explain 
now. It can be written in our case by the help of an appropriately defined gauge charge Q: 

Q:=\J cPxd^ixjdouix). (1.5) 

This leads to the following gauge variations for the fields: 

[Q,A„]=id^u , [Q,F^}=0 , {Q,u} = , {Q,u} = -i\d^ . (1.6) 

Obviously, these variations have a simpler structure than those in the BRS case. Perturbative 
quantum gauge invariance is then expressed by the following condition: 

n 

[Q,T n (xi,...x n )] = i^dfiT^xi, ...x n ) = (sum of divergences) , (1.7) 
l=i 

where is a mathematically rigorous version of the time-ordered product 

T^Ori, ...,*„)» ="T(T 1 (x 1 )...T? /1 (x l )...T 1 (x n )) , (1.8) 

[Q,Ti(x)\ =: idvT^x) , (1.9) 

constructed by means of the method of Epstein and Glaser. Note that the usual 4-gluon term 
is missing in T\. This term is generated by quantum gauge invariance at second order of 
perturbation theory. 

The paper is organized as follows. In the next section we introduce the asymptotic gauge 
fields in the covariant A-gauges and discuss their relation for different A. Then in the third 
section we construct a concrete representation in momentum space which is useful for certain 
computations in the next section. There we discuss the physical subspace and prove gauge 
independence of the time-ordered products on the physical subspace. This result is a consequence 
of gauge invariance for any A. The latter property is investigated in section 5. Two appendices 
contain technical details. 

2 Asymptotic fields in covariant A-gauges 

We shall use asymptotic gauge fields satisfying the modified wave equation 

□ 4 A ) = (l-A)d M dMW. (2.1) 
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Here A is a real gauge parameter, A = 1 corresponds to the Feynman gauge. We have omitted 
color indices etc. which are unimportant in this section, only the Lorentz structure matters 
here. The upper index (A) indicates that the field corresponds to the gauge parameter A, and 
we are going to consider the fields with different A simultaneously. 

First we want to solve the Cauchy problem for (2.1) with Cauchy data specified at time t = 
in the whole R 3 . For this reason we isolate the highest time derivatives in (2.1) 

Ad 2 4 A) = A 4 A) + (1 " A)d ^'4 A) (2-2) 

dlAf ] = A Af ] + (1 - X)d j (d°A i X) + <9% (A) ). (2.3) 

Consequently, in agreement with the ordinary wave equation, the Cauchy data are given by 
A^\o,x) and (doA^)(0, x). Taking the divergence <9 M of (2.1) we get for A / 

□^' = 0, (2.4) 

so that A^ satisfies the iterated wave equation 

□ 2 4 A) =0. (2.5) 

The Cauchy problem for this equation is considered in the Appendix. The solution can be 
written in terms of the Lorentz invariant distributions D(x) and E(x): 

AW(x)= J d 3 yD(x-y)dlAP(y)+ J d 3 y E(x - y)d Q nA^ (y). (2.6) 

2/0=0 2/0=0 

Here, all second and third order time derivatives under the last integral must be expressed by 
spatial derivatives of the Cauchy data by means of (2.2) and (2.3). 

It is very important to notice that the decomposition (2.6) is Lorentz covariant. Indeed, 
instead of selecting the plan yo = we may consider a smooth space-like surface a with a surface 
measure da v (y). Then, with help of Gauss' theorem, the integrals in (2.6) can be written in 
invariant form 

/<-> r <r+V 

d 3 y...d ^ / da u (y)... d , 

2/0=0 cr 

showing that each term on the r.h.s. of (2.6) is a Lorentz four-vector. 

Let us denote the first term in (2.6) which satisfies the ordinary wave equation by A™(x) (w 
for wave). The second term denoted by is equal to 

B^x) = (1 - A)/ d 3 yE(x - y)d d^dAW(y), (2.7) 

where (2.1) has been inserted. For \i = j = 1,2, 3 the derivative can be taken out by partial 
integration, so that 

B j (x) = d jX (x) (2.8) 

is a spatial gradient with 

X (x) = (l-A) J d 3 yE(x-y)d dA^(y). (2.9) 

2/0=0 
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But this is impossible for the zeroth component 

B (x) = d oX (x)+B(x). (2.10) 
The difference B(x) can be transformed as follows 

B(x) = (1-A) J d 3 y [E(x - y)d d y dAW(y) - d£E(x - y)d dA^(y)] 

= (1 - A) J d 3 yd y [E(x - y)d dA^(y)} 
= (1 - A) J d?y[dlEdQdA {> ^ + Ed$dA^ - d^EdA™ - 8^Ed dA^} 

= _(i _ X )J d 3 yuE{x - y)dA^(y) = -(1 - A) J d 3 yD(x - y)d v A^\y). (2.11) 

This shows that the field B{x) also fulfills the wave equation nB = 0. Therefore, it is 
tempting to combine it with A'q(x). The resulting four-component field 

Ai = (A$ + B,A?,A%,A%) (2.12) 

satisfies the wave equation and we have the simple decomposition 

4 A >=ilJ + a M X. (2-13) 

However, this decomposition has the serious defect of not being covariant (see (2.12)). Therefore, 
we must make a sharp distinction between the field A^ and the covariant field A^ in the Feynman 
gauge A = 1, although both fields satisfy the wave equation and the same commutation relation, 
as we shall see. 

Next we want to quantize the A^ -field. It follows from (2.6) that the commutation relations 
for arbitrary times must involve the distributions D and E. Then, Poincare covariance and the 
singular order lo = — 2 of the resulting distribution suggest the following form 

[4 A )(x), Ai x \y)} = ig^D(x - y) + iad„d u E(x - y), (2.14) 

where a common factor (h/2-jr) has been set =1. When operating with ng Kfl — (1 — X)d K d fl on 
the variable x, we must get zero. This determines the parameter a. Using uE{x) = D(x), we 
find 

[AP(x], 4 x \y)] = i 9iXV D(x -y) + d^E(x - y). (2.15) 

The corresponding commutation relations for the positive and negative frequency parts read 
(see (1.3)) 

[AW-\x), AW(+\y)]=ig^D(+\x-y) + i^(d ll d v E)M(x-y). (2.16) 

Note that the positive frequency part of the derivative (d u E) is well defined, in contrast to E^ . 
^From the initial values of the D- and ^-distributions 

D(0, x) = 0, (d D)(0, x) = 5 3 (x) (2.17) 
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(&SE)(0,x) = 0, n = 0,1,2, (d$E){0,x) = 5 3 (x) (2.18) 
we obtain the equal-time commutation relations 

[doAP(x),A^(y)] = i 9ta/ {l + 9( JlZ±)5{x - y) (2.19) 

[d A { X \x),d A ( j X \y)] = i^lldjSix-y), (2.20) 



where the subscript means xo = yo- All other commutators are zero. It follows from (2.19) 
(2.20) that 3-dimensional smearing with a space dependent test function f(x) is sufficient to get 
a well-defined operator in Fock space. 

^,From the fundamental commutation relations (2.15) the commutators of all other fields can 
be calculated because they are all expressed by A^ . We find 

lx(x),x(y)]=o (2-21) 
[AJ(x),At(y)}=i 9jk D(x- y ) (2.22) 

[A%(x),A%(y)] = ±D(x-y) (2.23) 

[A L ^x),A^(y)]=ig^D(x-y). (2.24) 

Now, AJ(x),j = 1,2,3 are the spatial components of a covariant vector field satisfying the 
wave equation. The commutation relations (2.22) are the same as for the Feynman field Aj[x). 
Nevertheless, we cannot identify the two as we shall see in the next section by constructing a 
concrete representation of the field operators. 



3 Concrete representation in momentum space 

Most authors who consider the A-gauges leave the construction of a concrete representation to 
the reader. We try to be more polite to our readers. 

Since three-dimensional smearing is enough to render A^ (x) well defined, we will construct 
all fields as three-dimensional Fourier integrals, leaving aside manifest Lorentz covariance. Our 
strategy will be to start with a representation of the time-zero fields which satisfies the equal- 
time commutation relations (2.19) (2.20) and then calculating the time evolution by the formulae 
of the last section. We follow the somewhat unusual, but mathematically more satisfactory 
procedure of assuming a Fock space with positiv definite metric and changing the form of the 
zeroth component A^ instead [10]. This is very natural in the A-gauge because the zeroth 
component plays a special role here, anyway. 

We use the usual emission and absorption operators for all four components satisfying 

[aM(p),aW + (q)\ = 5^5(p- q). (3.1) 

The adjoint is defined with respect to the positive definite scalar product so that these operators 
can be represented in the usual way in a Fock space J-^\ if smeared out with test functions 
f(p)£ L 2 (R 3 ). In addition we will use the operators for the longitudinal mode 
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where always u = \p\ = p°. Introducing the linear combinations 



b[ X \tf = —(a\ X \p) + a {X \p)) 
bi X \p) = -L(^\p)-ai X \p)), 



(3.3) 



we have the following commutators 



[a^ + (p),b[ X \q)]=-^5(p-q). 



1 Pu u ^ 



(3.4) 



In addition to the adjoint we have to introduce a second conjugation K which appears in all 
Lorentz covariant expressions and defines the so-called Krein structure [17]. It is defined by 



ai X \p) K = -ai X \p)+, a (X \p) K = a (X \p}\ j = 1,2,3. 



(3.5) 



Note that a^\a^+ are not treated as four- vectors, therefore, we write the indices always 
downstairs. 



The gauge field A^\x) must be self-conjugated A)?'^ = A)?' . Then, a little experimentation 
shows that the time-zero fields must be of the following form: 

73, 



(X)K 



I (A) 



AW(0, x) = (2vr)" 3 / 2 / ^JL {««( P >- + af® ( 



-tpx 



1 - A 



^b (X) (p)e ipx + ^b {X)+ {p)e~ ips 

id LO 



2V2X 

-2g^b[ X \p)e^ - 2g, bi X)+ (p)e-^ 



(3.6) 



(d A^)(0,x) = -i(2n)- 3 / 2 



d 3 p 



{\)K, 



-ipx 



1-A 



2^2X . 

-2g^b{ X \^ + 2g^iob (X)+ (p)e-^ 



(3.7) 



It is straight-forward to verify the commutation relations (2.15) (2.19) (2.20). 

The fields for arbitrary times can now be found from (2.6). For this purpose we need the 
following three-dimensional Fourier transforms (for yo = 0) 



J d 3 yD(x - y)e m = ~^(e iuJX ° - e^^e*** 
J d 3 y E(x - y y™ = ^ [ e *"*° (x° + 1) +e~ iu3f> (x° - ^) 



(3.8) 
(3.9) 
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We first compute 

(0MW)(O,a?) = ^(2vr)- 3 / 2 | d 3 p ^{b {X \p)e^ x - b {X)+ [p)e^ x ) . 



(d d M 4 A) )(M) 



■V2 



(27T) 



-3/2 



Then we obtain from the first term in (2.6) 

d 3 p 



d 3 p 
x/2uj 



■{b[ X) (p)e^ + bi X)+ (p)e- 



ipx 



Af(x) = (2vr)- 3 / 2 



(af\p)e- tpx + af )+ {f)e ipx )+djf^ 



(3.10) 
(3.11) 

(3.12) 



where 



fix) = -i- 



1 - A 



d 3 p 



'4A(2vr) 3 /2 7 ujS/2 

The zeroth component behaves differently 

1-A 



b[ X) (-p)e ipx - b {X)+ {-p)e~ ipx 



a ( X) {p)e~ ipx - a£ n (p)e ipx 



-^(b[ X \p)e' ipx + b {X)+ (p)e^ x )_ 



(3.13) 



Next we calculate x( x ) from (2.9) 
1-A 



v (,) - ^(2vr)- 3 / 2 / [b[ X \p)e-^(x° - ^)+b^ + ®e*° (x° + £)} -f{x) (3.14) 



V2X 

and -B(x) from (2.11 



A-l 
' V2X 



B(x) = ^(2-)" 3/2 / ^ + 4 A)+ (p)e-) 

<i 3 p 



(27T) 



-3/2 



( A )(_^ e ^ + 6W + (-p) e -^). 



This cancels against the second line in (3.13) so that 



4 X \x) = (2vr)- 3 / 2 



d 3 p 



a [X \p)e~^ x - a {X)+ (p)e ipx ] +d ( X + /)■ 



(3.15) 



(3.16) 



The first integral in (3.16) and (3.12) formally agrees with the Feynman field A^, but the latter 



is defined by means of different annihilation and creation operators aj^\p), a)l n (p) 



,(!)+/ 



Af(x) = (2VT)' 3 / 2 / [aPtfe-** + a^e 



(3.17) 



In A^ the terms with wrong frequencies ~ b[ X \—p) etc. cancel out. Then the resulting 
decomposition A^ = Aj^ + <9 M x is identical with the one introduced by Lautrup [16]. 
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Until now every field operates in its own Fock space J-^\ But there must exist a 
A-independent intersection of these J-^ where the gauge independent objects live. Indeed, in 
the foregoing equations the A-dependence is only through the unphysical scalar and longitudinal 
modes 61,62 (3.3), all equations involving only transverse modes which can be written down 
contain no A. Therefore, we can safely identify the transverse emission and absorption operators 
for different A. Let e M = (0,s) and rf = (0,ff) be two transverse polarization vectors 

p-Hp) = = p-ri(p), e 2 = l = ff, e-ff=0. (3.18) 

Then we put 

e% x Hp) = a £ (p), ^(P) = a v (p) (3.19) 
independent of A. Choosing one unique vacuum O for all field operators 

a £ (p)n = = a v (p)n = b[ x) (p)n = b { 2 x \p)n = o 

for all p (or rather after smearing with test functions f(p)), then the different Fock spaces 
hang together (fig.l). Their intersection is the physical subspace H. p h ys which is spanned by the 
transverse states (a£) m (a+) n Q. 



4 Gauge invariance and gauge independence 

Now we come to the study of the nilpotent gauge charge Q\ (1.5) 

Qx = \J &xdPAW{x)d Q u{x), (4.1) 

where the color indices are always suppressed if the meaning is clear. The ghost fields u, u are 
quantized as follows 

□11 = 0, □u = 
{u a (x), u b (y)} = -i5 ab D(x - y). (4.2) 
Since there is no A-dependence here, they can be represented in the usual way 

u{x) = (2^ 3 / 2 J ^JL (c 2 (p)e-^ + c+(p)e^) (4.3) 
u(x) = (2vr)- 3 / 2 J (- Cl (p)e-^ + 4(p)e^) (4.4) 

where 

{c i (p),cf(q)}=5 ij 5(p-q), i,j = 1,2. 
The conjugation K is extended to the ghost sector by 

C2(P) K = Cl (p) + , Cl (p) K = c 2 (p) + (4.5) 

so that u K = u is i^-selfadjoint and u K = —u. Then Q\ (4.1), if densely defined, becomes 
.RT-symmetric Q\ C Q\- It is not necessary for the following to give an explicit description of 
the domain. According to a general result [18], it has a K-selfadjoint extension = Q\ which 
is a closed operator and this is all we need for our purpose. 
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Using (3.10), (3.11) and (4.3) it is easy to calculate Q\ in momentum space 

Q\ = V% J d 3 P u>(p)[h(p)c+(p) + b+(p)c 2 (p)}. (4.6) 

For typographical simplicity we have not written the A-dependence in b\, b 2 . Qx together with 
its adjoint 

Q+ = V2j d 3 pu(p)[ci(p)bt(p) + 4(p)b 2 (p)] (4.7) 

are unbounded closed operators; the unboundedness is not only due to the emission and absorp- 
tion operators but also because of to(p) = \p\. 

Since Q\, Q A are closed operators, we have the following direct decompositions of the Fock 

space 

F {X) = Ran Q\ © Ker = Ran Q| © Ker Q A , (4.8) 

where Ran is the range and Ker the kernel of the operator. The overline denotes the closure; 
note that Ran Q\ is not closed because is in the essential spectrum of Q\. Now, Q\ = implies 
RanC^A -L RanQ A , therefore, it follows from (4.8) that 

F {x) = Ran Q x © Ran Q+ © (Ker Q\ n Ker Q+) . (4.9) 

The range of Q\ and Q A certainly consists of unphysical states because (4.6) and (4.7) only 
contains emission operators of unphysical particles (scalar and longitudinal "gluons" and ghosts). 
The physical states must therefore be contained in the last subspace in (4.9). 
We claim that 

KerQ A nKerQ+ =Ker{Q A ,Q+} (4.10) 

where the curly bracket is the anticommutator. Indeed, if a vector / G J 7 ^ belongs to the l.h.s., 
that means Q\f = = Q\f then it is also contained in the r.h.s. Inversely, if / belongs to the 
r.h.s. then 

= (f,{Qx,Ql}f) = WQxfW 2 + \\Qlff, 

it is also contained in the l.h.s. Calculating the anticommutator from (4.6) (4.7) we find 

{Qx, Qj} = 2 / d 3 P Lo\p) [bt(p)h(p) + bt(p)b 2 (p) + ct(p) Cl (p) + 4(p)c 2 (p)] . (4.11) 

Up to the (positive) factor to 2 this is just the particle number operator of the unphysical particles. 
The physical subspace is characterized by the fact that there are no unphysical particles, hence, 

Wphys = Ker {Qa, Q\} (4.12) 

and this is a closed subspace. As discussed above, it is the intersection of all 

We introduce the projection operator P\ on H p h ys - It is our goal to prove the gauge in- 
dependence of the physical S-matrix P\S( x ^(g)P\. The perturbative formulation in terms of 
time-ordered products (1.1) would be 

P X T^P X = P 1 T^P 1 + div. (4.13) 

Here div denotes a sum of divergences which vanish after integration with test functions g{x\) 
. . . g(x n ) in the formal adiabatic limit where terms with derivatives of g are neglected. In (4.13) 
we have compared the physical n-point functions in the A-gauge with the Feynman gauge A = 1. 
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Gauge independence (4.13) is a direct consequence of gauge invariance (1.7). That (1.7) really 
holds for arbitrary A is discussed in the next section. Gauge invariance implies the following 
important proposition (see [14], eq.(5.28)) 

PT(X l )PT(X 2 )P = PT(X 1 )T(X 2 )P + div. (4.14) 

Here we have omitted indices n and subscripts A to indicate that (4.14) holds for arbitrary A and 
arbitrary n-point functions. For the sake of completeness we give a proof of (4.14) in Appendix 
2. 

The proof of gauge independence is by induction on n. The beginning n = 1 can be easily 
verified because P\A$ P x = P\A$ P\ and T\ (1.2) does not depend explicitly on A. Let us now 
assume that 

PxT^Px = PiT^Pi + div (4.15) 
holds for alH < n — 1. Then we consider arbitrary products 

P X T^(X 1 )T (X \X 2 )P X = P X T (X \X 1 )P X T (X \X 2 )P X + divi, (4.16) 

where we have used (4.14). Due to the induction assumption (4.15) this is equal to 

= P 1 T^ 1 \X 1 )P 1 T^(X 2 )P 1 + div 2 = P 1 T«(X 1 )T( 1 )(X 2 )P 1 + div 3 . (4.17) 

Here we have used (4.14) again. The causal D-distribution of order n in the Epstein-Glaser 
construction is a sum of such products (4.16), hence, it follows that 

P X D^P X = P 1 D^P 1 + div. (4.18) 

All three terms in here have separately causal support, therefore they can individually be split 
into retarded and advanced parts. The local nomalization terms can be chosen in such a way 
that 

P X R^P X = PiR^Pi + div, (4.19) 

where R denotes the retarded distributions. We must check that this way of normalization is 
not in conflict with the normalization which we adopt to achieve gauge invariance (see next 
section). But this is not the case for the following reason. We decompose 

T n = PT n P + W n . 

The condition (4.19) concerns the physical part PT n P, only. But the latter is gauge invariant 
for any normalization 

QPT n P - PT n PQ = 

because PQ = = QP. Therefore, the normalization in the proof of gauge invariance involves 
only the unphysical part W n . From the gauge independence of the retarded distributions (4.19) 
we get the same result for the n-point distributions 

P X T^P X = P 1 T<pP 1 + div (4.20) 

in the usual way. This completes the inductive proof. 
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5 Gauge invariance in an arbitrary A-gauge 



Gauge invariance (1.7) has been proven in the Feynman gauge A = 1 [11,15,19]. Here we 
summarize and reformulate that proof in a way which is manifestly independent of the choice 
of A. 

The generator Q\ (4.1) of the gauge transformations depends on A. But the parameter A 
drops out in the gauge variations of the free fields A^ , F^ u and u with the exception of u (1.6). 
However, we shall work with a ghost coupling (1.2) containing the field u a in the form d^Ua 
only. The gauge variation of the latter field can be written in a A-independent form 

{Q, d^u a } = -i\d^A au = id v F avii (5.1) 

by means of the equation of motion (2.1). 

1. Gauge invariance at first order 

The coupling (1.2) is gauge invariant at first order (1.9) with the 'Q-vertex' 

ZT/i(*) = *S/«6c[: A^{x)u b {x)F^{x) : -\ : u a {x)u b {x)d v u c {x) :]. (5.2) 

for any value of the gauge parameter A. The most general coupling which is gauge invariant at 
first order, symmetrical (Lorentz covariant, SU(N)-invariant, P-,T-,C-invariant, pseudo-unitary) 
and is compatible with renormalizability contains a non- uniqueness in the ghost sector [21] 

Ti + Pi{Q,gfabc ■ u a u b u c :} + f3 2 0^[igfabc ■ A%u b u c :], (5.3) 

Pi, S R arbitrary, and Ti is given by (1.2). We shall prove gauge invariance in the case 
Pi = = P2- For general values of Pi,P 2 € R a manifestly A-independent formulation is 
impossible, since the fields u a (without derivative) (1.6) and d^A^ appear in the coupling. (The 
'bad' behavior of duA£ is explained below in subsect.5.3) But it has been proven [20] that gauge 
invariance for Pi = = P2 implies gauge invariance for arbitrary Pi, P 2 £ R at least at low 
orders. The argumentation of that proof is of general kind, such that it applies to any choice of 
A. 

2. Outline of the proof of gauge invariance in feynman gauge A = 1 

In this subsect. we summarize the proof of gauge invariance (1.7) which was given for A = 1 in 
[11,15,19]. In the next subsect. we shall see that this proof needs no modifications for arbitrary 
A. The proof is by induction on the order n of the perturbation series. The operator gauge 
invariance (corresponding to (1.7)) of A' n ,R' n and D n = A' n — R' n , 

n 

[Q,D n (x 1 ,...,x n )}=iY / 0;'D^ /l (xi,...,x n ), (5.4) 
1=1 

has been proven in a straightforward way [11] from the gauge invariance of the T m , m < n — 1. 
This proof is very instructive because it shows that our definition (1.7) of gauge invariance is 
adapted to the inductive construction of the T n 's. However, the distribution splitting D n = 
Rn-A n can only be done in terms of the numerical distributions d n — r n a n . Therefore, we 
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have to express the operator gauge invariance (5.4) by the Cg-identities for D n , the C-number 
identities for gauge invariance, which imply the operator gauge invariance (5.4). 

However, there is a serious problem [11,19]. Terms with different field operators may com- 
pensate, due to identities like 

[u(xi) - n(x 2 )]<9^ 1 5(xi - x 2 ) + S(xi - x 2 )d fM u(xi) = 0. (5.5) 

Therefore the definition of the C-number distributions in R' n , A' n (and therefore in D n = R' n —A' n ) 
has a certain ambiguity because terms ~ (86) : A... : can mix up with terms ~ 6 : OA... :~ 5 : 
F... : .To get rid of these ambiguities, we choose the convention of only applying Wick's theorem 
(doing nothing else) to 

A' n (xi,...;x n ) = ^2f k (Y)T n _ k (Z,x n ), (5.6) 

Y,Z 

where the (already constructed) operator decompositions of T k , T n ^ k are inserted. In this way 
we obtain the so-called natural operator decomposition of A' n 

A' n = J2a' :0:, (5.7) 
o 

where the sum runs over all combinations O of free field operators. We similarly proceed with 
A' n ^, R' n and R' n n and define oIq == ' r$ — clq ■ Then, we split the numerical distributions d^} 

with respect to their supports into retarded and advanced parts d@ = r@ — o!q . Next we define 
t'$ = f r@ — r'$ and symmetrize it, which yields tg. The definition 

r„(/i)= E$:0:, (5.8) 
o 

gives T n yi) in the natural operator decomposition. Note that this procedure fixes the numerical 

distributions uniquely, up to the normalization in the causal splitting dp = r@ — a@ . 

Starting with the natural operator decomposition of D n (T n resp.) and (T^^), we 

commute with Q or take the divergence d^ 1 according to (5.4) and obtain the natural operator 
decomposition of (5.4) ((1.7) resp.). However, due to (5.5), the Cg-identities for D n cannot be 
proven directly by decomposing (5.4). We must go another way: Instead of proving the operator 
gauge invariance (1.7), we prove the corresponding Cg-identities (by induction on n), which are 
a stronger statement. In this framework the Cg-identities for D n can be proven by means of the 
Cg-identities for T k , T k in lower orders 1 < k < n — 1 . 

The Cg-identities for T n are obtained by collecting all terms in the natural operator decom- 
position of (1.7) which belong to a particular combination : O : of external field operators. By 
doing this the arguments of some field operators must be changed by using ^-distributions, i.e. 
by applying the simple identity 

: B{ Xi )0{X) : 5(x t - x k )... =: B{x k )0{X) : 5{ Xi - x k )... (5.9) 

where X = f (xi,X2, ...£„) and 0{X) means the external field operators besides B. 

We are now able to give a precise definition of the statement that the Cg-identities hold: We 
start with the natural operator decomposition of (1.7). Using several times the identity (5.9), 
we can obtain an operator decomposition 

n 

[Q,T n (X)\ - i^diT^X) = Y,Tj(X) : Oj(X) :, (5.10) 
l=i j 
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(where Tj(X) is a numerical distribution and : Oj(X) : a normally ordered combination of 
external field operators ) which fulfils 

Tj(X)=0, Vj. (5.11) 

The decomposition (5.10) must be invariant with respect to permutations of the vertices. 

A Cg- identity is uniquely characterized by its operator combination : O :. The terms in a 
Cg- identity are singular of order [11] 

\0\ + l (5.12) 

at x = 0, where 

\0\=A-b-g u -gu-d. (5.13) 

Here, b, g u , g^ are the number of gluons and ghost operators u, u, respectively, in O, and d is the 
number of derivatives on these field operators. 

There are no pure vacuum diagrams contributing to (1.7), i.e. terms with no external legs. 
The disconnected diagrams fulfil the Cg-identities separately. This can be proven easily by means 
of the Cg-identities for their connected subdiagrams, which hold by the induction hypothesis. 

Let us consider a connected diagram in the natural operator decomposition of (1.7). We 
call it degenerate, if it has at least one vertex with two external legs; otherwise it is called 
non- degenerate. Let Xi be the degenerate vertex with two external fields, say B\, B^. Such a 
'degenerate term' has the following form 

: B 1 (x i )B 2 (xi)B 3 (x jl )...B r (xj r _ 2 ) : A(xi - x k )t n -i(xi - x n , ...xl - x n , i - x n ), (5.14) 

where k / i, ji / i (V7 = 1, ...,r — 2) and the coordinate with bar in t n -\ must be omitted. 
In general, there is a sum of such terms (5.14) belonging to the fixed (degenerate) operator 
combination : O :=: Bi{xi)B2(xi)B^(xj 1 )...B r (xj r _ 2 ) :. For A(xj — x k ) the following possibilities 
appear: 

(a) A = D F , dD F , d^d u D F (// / v), d p d^d u D F (p ^ v ^ p ^ //), 

(b) A = 5( 4 \ 85^. 

The 85^ -terms in (b) cancel [15]. If a degenerate term (5.14) with A = 5^ (type (b)) 
can be transformed in a non-degenerate one by applying (possibly several times) the identity 
(5.9) only, we call it 8 -degenerate; if this is not possible we call it truly degenerate. All other 
degenerate terms (i.e. the terms of type (a)) are called truly degenerate, too. 

The truly degenerate terms fulfil the Cg-identities separately, by means of the Cg-identities for 
their subdiagrams (sect. 3.1 of [19]). The latter hold by the induction hypothesis. The exception 
are some tree diagrams in second and third order, which need an explicit calculation (sect. 3. 2 
of [19]). 

There remain the non-degenerate and (5-degenerate terms, which are linearly dependent. 
Therefore, the 5-degenerate terms must be transformed in non- degenerate form by using (5.9). 
In this way we obtain completely new Cg-identities, in contrast to the disconnected and the 
truly degenerate Cg-identities, which rely on Cg-identities in lower orders. Therefore, it is not 
astonishing that the difficult part of the proof of the Cg-identities concerns the non-degenerate 
: O : (including 5-degenerate terms). First one proves the Cg-identities of the non-degenerate 
and (5-degenerate terms for A' n , R' n (and therefore also for D n ) by means of the Cg-identities in 
lower orders (sect. 4.1 of [19]). In the process of distribution splitting the Cg-identities can be 
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violated by local terms only which are singular of order \0\ + 1 (5.12), i.e. the possible anomaly 
has the form 

\o\+i 

a( Xl ,...x n )= ]T C b D b 5 A{ - n - 1 \x 1 -x n ,...). (5.15) 
l*l=o 

We see that we only have to consider Cg-identities with 

\0\ > -1. (5.16) 

This occurs only for Cg-identities with 2-,3-,4-legs and one Cg-identity with 5-legs (: O :=: 
uAAAA :). For the latter the colour and Lorentz structures exclude an anomaly (5.15) [15]. 
For the Cg-identities with 2-, 3- and 4-legs we first restrict the constants C& in the ansatz (5.15) 
by means of covariance, the SU(N)-invariance and invariance with respect to permutations of 
the inner vertices. Then we remove the possible anomaly by finite renormalizations of the t- 
distributions in the Cg-identity. If a certain distribution t appears in several Cg-identities, the 
different normalizations of t must be compatible. For certain Cg-identities (: O :=: uAA : 
,: uAAA :,: uuduA :) the removal of the anomaly is only possible, if one uses additional 
information about the infrared behavior of the divergences with respect to inner vertices [15]. 

3. The modifications of the proof of gauge invariance for arbitrary A 

Going over to an arbitrary A-gauge there are two fundamental changes: 

(A) The wave equation for the free gauge field A% is replaced by (2.1). However in the proof 
of gauge invariance the equation of motion for A^ is used in (5.1) only. Therefore, by working 
always with {Q, d^u a } = id v F au ^ the modification of the equation of motion causes no changes 
in the proof of gauge invariance. 

(B) The commutator [vl^,^] (2.15) has an additional A-dependent term with the dipole 
distribution E. Similar changes appear in the positive and negative frequency part of (2.15), 
as well as in the retarded, advanced and Feynman propagator. All other commutators rsp. 
propagators are independent of A, e.g. [A afM , F bvT \. If we would work with another ghost coupling 
{Piifo) / (0,0) the field d^A^ would appear, which has s a A-dependent commutator with A v 

\d»A ail {x),A hv {y)\ = l -5 ah d u D(x - y). (5.17) 

We now have to check that the explicit form of the ylyl-commutator rsp. propagator is not used 
in the proof of the Cg-identities: 

- Second order tree diagrams: The explicit form of the propagators is used in the verification 
of gauge invariance for the second order tree diagrams. But gauge invariance can only be violated 
by local terms ~ (d)5(xi — X2). The latter can only appear if the propagator is of singular order 
to > —1 (see (5.12)). But the AA-propagator (without derivatives) has u> = —2 and, therefore, 
plays no role in this calculation. In all other propagators (with derivatives) the A-dependence 
drops out because the derivatives occur in the antisymmetric F, only. Especially we conclude 
that the four-gluon interaction (which is a normalization term of the second order tree diagram 
with external legs : A(xi)A(xi)A(x2)A(x2) : and is uniquely fixed by gauge invariance [11,19]) 
is independent of A and that it is the only local term in T^ltree- 

- (^-degenerate terms: If A(xi — Xk) in (5.14) originates from an AA-propagator (without 
derivatives), we know about the singular order a; (A) < u([A, A]) + 1 = —1. Therefore, A / S, 85 
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and the set of 5-degenerate terms is unchanged for A 7^ 1. Of course most t-distributions depend 
on A (due to (2.15)), but we conclude that the Cg-identities belonging to non- degenerate : O : 
(which include the (5-degenerate terms) are manifestly independent of A. (This is obvious for the 
non-degenerate terms.) 

- We turn to the proof of the Cg-identities belonging to non-degenerate : O : for a' and r' 
by means of the Cg-identities in lower orders (sect. 4.1 of [19]). There one has to show that the 
operator decomposition of [Q, A' n ] = [Q, J2 ^fc^n-fc] is unchanged if we interchange the operation 
[Q, .] with contracting. For this purpose one needs the explicit form of some propagators, but 
the AApropagator is not used. The non-trivial step is the cancellation of the terms arising by 
contracting the commutated leg. 

- The same cancellation is used in the proof of the Cg-identities for the truly degenerate 
terms by means of the Cg-identities for their subdiagrams (sect. 3.1 of [19]). Again the explicit 
form of the AApropagator plays no role. 

We emphasize that (A) and (B) are the only relevant changes for arbitrary A. Especially the 
singular order of the numerical distributions (5.12-13) and the symmetries (Lorentz covariance, 
SU(N)-invariance, P-,T- and C-invariance, pseudo-unitarity and invariance with respect to per- 
muatations of the vertices) are manifestly independent of A. Consequently, the ansatz (5.15) for 
the possible anomalies (in the Cg-identities belonging to non-degenerate : O :) remains the same 
and the constants C5 in (5.15) can be restricted in the same way. Moreover, the normalization 
polynomials of the t-distributions are unchanged and, therefore, we can use them to remove the 
anomalies in the same way. Finally gauge invariance of third order tree diagrams, which must 
be verified explicitly (sect. 3. 2 of [19]), and the proof of the non-trivial 5-legs Cg-identity [15] rely 
on the SU(N)-invariance and Lorentz covariance. Therefore, also this parts of the proof need no 
change. 

Summing up we see that the inductive proof of the Cg-identities is manifestly independent 
of A if we choose the ghost coupling f3\ = = (5-3) and always work with instead of 
d„A v (5.1). 

The coupling to fermionic matter fields (in the fundamental representation) can be added to 
this model. Gauge invariance holds true if and only if the coupling constants agree (universality 
of charge). This has been carried out in the Feynman gauge in [20]. There are no changes for 
arbitrary values of A. 

Appendix 1: Cauchy problem for the iterated wave equation 

First we formulate the Cauchy problem for the equation 

n 2 u={dl-dl-dl-dlfu = Q. (Al) 

Since (A.l) is of forth order in time xo = t, a complete set of Cauchy data at t = is given by 

(d%u)(0,x)=u n (x), n = 0,l,2,3. (A2) 

For simplicity we assume the u n to be in Schwartz space, then the initial- value problem (A.l) 
(A. 2) has a unique solution. This solution can be constructed by means of the tempered distri- 
butions D(x) and E(x), defined by 

□ D = 0, D(0,x) = 0, (d D)(0,x) = 5 3 (x), (A3) 
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a 2 E = 0, (d%E)(0,x) = 0, n = 0,1,2, (d$E)(0, x) = 8\x). (A4) 

D is the well known Pauli-Jordan distribution and E is sometimes called dipole distribution and 
we will soon compute it. 

We now claim that the solution of the Cauchy problem (A.l) (A. 2) is given by 

u{x) = j d 3 y D(x - y)ni(y) - d%D(x - y)u (y) + 

+E(x - y){u 3 - A Ul )(y) - d y Q E{x - y)(u 2 - Au ){y)] , (A5) 

where A denotes the three-dimensional Laplace operator. This formula is the same as the 
covariant equation (2.6) which is an obvious generalization of the solution of the ordinary wave 
equation. Using (A. 3) and (A. 4) it is a simple task to verify (A.l) and (A. 2). Therefore it 
remains to construct the dipole distribution E. 
^From (A. 3) and (A. 4) we get 

uE{x) = D{x) (A6) 

and we want to obtain E as solution of this equation. We solve this problem in momentum 
space. The Fourier transform of D is well known 



so that 



D{p) = —sgn Po S(p 2 ), (A7) 

Z7T 



p 2 E{p) = -—sgap 5(p 2 ). (A8) 



A solution of this equation can immediately be written down by means of the identity 



P 2 S'( P 2 ) = ^(P 2 S(P 2 ))-5(P 2 ) = -KP 2 ), (A9) 



namely 



E(p) = — sg np S'(p 2 ). (A10) 

By inverse Fourier transform the initial conditions (A. 4) can be verified and E(x) can be com- 
puted 

E(x) = ^ S gn(x )e(x 2 ). (All) 
Note that the positive frequency part 

"M^ipr = ^@(po)s\p 2 ) 

is ill-defined. This never occurs in rigorous calculations. Only derivatives of E have to be split 
into positive and negative frequency parts (see (1.3)) and these are well-defined. 
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Appendix 2 

Here we prove the relation (4.14). We start from the orthogonal direct decomposition (4.9) 
which can be written as 

1 = P Q + P Q+ +P (A12) 



where Pq and Pq+ are projection operators onto RanQ and RanQ + and P projects on TC p h y s- 
The operator (4.11) 

{Q,Q + } = R>0 (A13) 
is positive selfadjoint on the orthogonal complement Wp hys of 7i p h y s, so that it has an inverse 

KR- 1 = P Q + P q + = K~ X K, R- 1 P = 0. (A14) 

This allows to write (A. 12) in the form 

1 = P + QQ+R- 1 + Q+QK- 1 . (A15) 

Now we consider 

PT(X 1 )T(X 2 )P = PT(X 1 )(P + QQ+R- 1 + Q + QR- 1 )T(X 2 )P 

= PT{X 1 )PT(X 2 )P + PT(X 1 )QQ + R- 1 T(X 2 )P+ 

+PT(X 1 )Q + QR- 1 T(X 2 )P. (A16) 
Since PQ = 0, the second term is equal to 

P[T(X l ),Q)Q+R- 1 T(X 2 )P 

which is a divergence due to gauge invariance of T(X\). 

In the last term in (A. 16) we use the fact that R and, hence, R~ l commute with Q which 
follows easily from the definitions (A. 13), (4.11) and (4.6). Then we conclude that 

PT(X 1 )Q+K- 1 QT(X 2 )P = PT(Xi)Q+K- 1 [Q,T(X 2 )]P 

is also a divergence. Consequently, 

PT(Xi)T(X 2 )P = PT(X 1 )PT(X 2 )P + div 

which is the desired relation (4.14). 
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Figure Caption 

Figure 1. Relation between Fock spaces with different values of the gauge parameter A. 
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